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SPEEDING  UP  THE  HELP  HYDRODYNAMIC  CODE 


1.  INTRODUCTION 


The  two-dimensional  Eulerian  hydrodynamic  computer  code  HEIP 
(Hydrodynamic  Elastic  _Plastic)  has  been  employed  at  the  Materials  Research 
laboratories  (MRL)  for  some  years  to  model  explosive-metal  interactions  in 
warhead  research.  HE If  was  developed  in  the  US  during  the  1970's,  and  is 
well  described  in  Reference  [1],  The  version  of  HE If  in  use  at  MRL  [2]  was 
obtained  from  the  US  Army  Ballistic  Research  laboratory  through  the  auspices 
of  TTCP. 


HE If  suffers  the  common  disadvantage  of  Eulerian  codes  in  that  it 
demands  long  computer  run-times  to  solve  useful  problems.  At  MR  I,  HE If  has 
largely  been  run  on  a  CDC-7600  computer  on  the  CSIRONET  network.  For  warhead 
problems  such  as  shaped  charge  collapse,  3  to  4  hours  CPU  time  on  the  7600 
would  be  required  for  a  typical  run.  This  is  not  only  expensive  but  also 
inconvenient,  as  run-times  of  this  length  exceed  user  limits  on  CSIRONET,  and 
can  only  be  achieved  by  stopping  and  restarting  at  a  later  date. 

With  these  factors  in  mind,  it  was  decided  to  investigate  the 
possibility  of  improving  the  efficiency  of  HE IP,  without  changing  its  basic 
modus  operandi.  This  report  describes  the  analysis  of  the  code's  run-time 
performance,  and  presents  a  modification  to  HE If  which  increases  its  speed  up 
to  threefold. 


2.  ANALYSIS  OF  HELP  PERFORMANCE 


2.1  Run-time  Profiles  by  Subroutine 

The  first  approach  to  analysing  the  performance  of  HE If  was  to 
establish  which  of  the  subroutines  were  absorbing  most  of  the  computer  time. 
This  was  achieved  using  the  P-register  sampling  utilities  (PREGON,  PRLIST) 


available  on  the  7600  under  the  SCOPE  operating  system  [3].  These  utilities 
record  the  programme  address  under  execution  every  3.6  milliseconds,  when  the 
hardware  clock  interrupts  the  executing  programme  for  system  housekeeping.  At 
the  end  of  execution,  these  data  are  processed  to  build  a  statistical  profile 
of  programme  addresses  vs  frequency.  A  loader  map  can  then  be  used  to 
associate  these  addresses  with  programme  segments. 

Two  fairly  typical  problems  previously  run  with  HELP  at  MRL  were 
chosen  and  re-run  with  P-register  analysis.  The  first  of  these  was  a  i .5 
inch  diameter  hemispherical  shaped  charge,  given  in  Reference  [2]  as  a  sample 
problem  for  the  BRL  version  of  HELP.  The  complete  input  deck  for  this  run  is 
given  in  Appendix  1.  (The  input  has  been  modified  slightly  from  that  in 
Reference  [2],  partly  due  to  memory  constraints,  and  partly  to  include  the 
strength  phase  of  HEIR  to  assess  its  time  consumption) .  Schematic 
illustrations  for  this  problem  at  the  beginning  and  end  of  the  HELP  run  may  be 
found  in  Figures  1  and  2  respectively.  The  second  study  chosen  was  a  38  mm 
shaped  charge  of  the  type  used  frequently  in  experimental  studies  at  MRL  [4]. 
The  input  deck  for  this  run  is  given  in  Appendix  2,  and  illustrations  are  at 
Figures  3  and  4. 

The  profile  analyses  for  these  two  jobs  are  presented  in  terms  of 
the  percentage  of  total  CPU  time  taken  by  each  subroutine.  The  hemisphere 
job,  run  1.023,  used  5720  seconds  of  7600  time,  and  its  analysis  is  shown  in 
Table  1 .  The  shaped  charge  job,  run  50.0009,  was  stopped  after  500  cycles 
and  used  6829  seconds  CPU  time;  its  analysis  is  shown  in  Table  2. 

The  profiles  reveal  that  in  both  runs  the  tracer  moving  subroutine 
MOVTCR  accounted  for  the  bulk  of  the  run-time.  This  routine  used  61.7%  of 
the  total  time  in  the  first  run,  and  68.8%  in  the  second.  clearly  a 
significant  improvement  to  the  overall  performance  of  HEIP  could  only  be 
achieved  by  improving  MOVTCR,  and  hence  this  subroutine  was  analysed  in 
detail. 


2.2  Analysis  of  Subroutine  MOVTCR 

Subroutine  MOVTCR  is  called  on  every  sub-cycle  of  HELP,  to  move  the 
massless  tracer  particles  which  define  material  boundaries.  MOVTCR  also 
moves  cell-centred  passive  tracers  if  these  have  been  selected.  Both  types 
of  tracer  are  moved  in  a  pseudo  -  continuous  local  velocity  field:  the  method 
is  described  in  detail  in  Section  4.1.2  of  Reference  [1]. 

Bi  order  to  establish  exactly  which  segments  of  MOVTCR  were 
consuming  the  computer  time,  a  further  P-register  analysis  run  was  performed. 
This  run  concentrated  on  the  load  addresses  occupied  by  MOVTCR.  A  cross- 
referenced  listing  then  enabled  identfication  of  the  offending  code.  It  was 
found  that  over  95%  of  the  subroutine's  time  was  spent  in  one  small  inner 
loop.  Appendix  3  contains  a  listing  of  MOVTCR:  the  loop  referred  to  is  DO 
1200,  between  MOVTRC.48  and  MOVTCR. 52. 

Understanding  the  function  of  this  loop  requires  a  little  knowledge 
of  the  workings  of  HEIP.  Each  material  package  is  defined  spatially  by  an 


2 


ordered  set  of  tracer  particles  which  form  the  package  boundaries.  These 
tracers  are  also  used  by  the  code  to  "sense"  whether  material  is  entering  or 
leaving  a  cell,  and  to  define  fractional  cell  areas  across  which  such 
transport  may  occur.  This  is  quite  the  most  complicated  logic  in  HELP.  Hie 
logic  requires  that  the  "void  package",  present  in  nearly  all  HELP  problems, 
have  its  own  set  of  tracer  particles.  These  tracers  must  be  coincident  with 
tracers  from  other  material  packages  but  are  kept  as  an  independent  ordered 
set.  When  tracer  particles  for  (non-void)  materials  are  moved  on  each  cycle 
or  sub-cycle,  the  corresponding  void  tracer  is  found  and  moved  identically. 

The  function  of  the  DO  1200  loop  in  MOVTCR  is  simply  to  find  whether  the 
tracer  being  considered  has  a  corresponding  void  tracer  which  needs  to  be 
moved.  Since  most  typical  HELP  runs  involve  in  the  order  of  1000  tracers  for 
each  package,  it  is  not  surprising  that  this  loop,  being  executed  each  sub¬ 
cycle  in  the  order  of  1,000,000  times,  proved  very  time-consuming. 


3.  MODIFICATIONS  TO  MOVTCR 


Since  coincident  tracers  are  moved  identically,  it  is  not  necessary 
to  check  for  these  every  cycle  or  sub-cycle,  because  they  will  remain 
associated.  The  only  way  this  association  could  alter  is  when  new  tracers 
are  added  or  existing  ones  removed.  This  can  only  happen  when  subroutine 
ADDTCR  is  called  to  interpolate  new  tracers  because  existing  ones  have  become 
sparse,  or  when  tracers  are  adjusted  manually  on  restarts  or  automatically  by 
VDCL0S,  the  void  closing  routine. 

The  solution  was  to  associate  each  void  package  tracer  with  a 
corresponding  material  tracer  and  store  this  information  at  the  time  of 
problem  set-up.  Further  checks  then  need  only  be  made  if  tracers  have  been 
added  or  deleted. 

To  do  this,  a  new  subroutine  TCRCHK  was  added  to  HELP.  This 
subroutine  checks  each  tracer  of  the  void  package  to  find  a  coincident 
material  package  tracer.  A  "pointer"  array  is  filled  so  that  each  void 
tracer  has  a  floating  point  number  associated  with  it.  The  fractional  part 
of  this  number  contains  the  material  package  number,  and  the  integer  part  the 
tracer  number  for  the  matching  tracer.  Subroutine  MOVTCR  now  moves  all  non¬ 
void  tracers  first,  and  then  moves  void  tracers  according  to  the  information 
encoded  in  this  array.  TCRCHK  is  called  on  problem  initialisation  and  then 
only  called  again  if  ADDTCR  adds  new  tracers,  if  VDCLOS  removes  tracers,  or  on 
problem  restart.  An  UPDATE  [5]  correction  run  to  incorporate  these 
modifications,  including  the  new  routine  TCRCHK,  is  given  in  Appendix  4. 

Whilst  it  is  not  claimed  that  this  solution  is  necessarily  the  most 
elegant,  its  attraction  is  that  it  makes  no  changes  to  the  existing  tracer 
particle  logic  in  HELP  and  hence  cannot  have  any  unforeseen  consequences.  It 
should  also  substantially  reduce  run-times  for  all  HEIP  simulations  which 
include  a  void  package. 


4.  RESULTS 


Hie  two  test  problems  chosen  in  Section  2.1  were  re-run  with  the 
modified  version  of  HELP.  Resultant  outputs  were  found  to  be  identical.  For 
the  hemisphere  calculation,  run-time  was  reduced  from  5720  seconds  to  2440 
seconds  (43%),  and  for  the  shaped  charge  calculation  from  6829  seconds  to  2386 
seconds  (35%).  Tables  3  and  4  show  subroutine  profile  analyses  for  these 
runs.  For  the  hemisphere  run  subroutines  MOVTCR  and  the  new  routine  TCRCHK 
together  accounted  for  6.9%  of  the  run-time,  compared  to  the  previous  figures 
of  61.7%.  For  the  shaped  charge  run  the  corresponding  figures  are  7.6%  and 
68.8%. 


It  seems  reasonable  to  assume  that  for  most  applications  of  HELP, 
this  modification  will  increase  the  code's  speed  by  a  factor  of  between  two 
and  three . 
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APPENDIX  1 


JOB  TO  RUN  HEMISPHERICAL  SHAPED  CHARGE  PROBLEM 


SCRUN (P10 ,T20000 ,MS140000 ,ML600) 

COMMENT.  JOB  TO  RUN  PROBLEM  1.023 
COMMENT.  SAMPLE  PROBLEM  OF  HEMISPHERE 
LIMIT  (0) 

GETSET (DFC4405) 

REQUEST (TAPE1 , *PF , SN=DFC4405) 

REQUEST (TAPE7 , *PF , SN=DFC4405) 

REQUEST (TAPE20 , *PF , SN=DFC4405) 

ATTACH, LGO,$LGO  FOR  MRLHLP20$,SN=DFC4405, ID=DFCDLS. 

MAP (PART) 

PREGON (FWA=0 , LWA=140000) 

LGO (PL=200000) 

EXIT  (U) 

PRLIST(,TAPE20,) 

CATALOG (TAPE20,$PRLI ST  JOB  1.023$,SN=DFC4405,ID=DFCDLS,RP=998) 

REWIND (TAPE20) 

COPY (TAPE20 , OUTPUT) 

EXIT (U) 

REWIND (TAPE7) 

ATTACH (LC02.9LG0  FOR  FI CHE  PLOTS?, SN=DFC4405, ID=DFCDLS) 

MAP (OFF) 

LG02 . 

EXIT  (U) 

REWIND (TAPE7) 

CATALOG (TAPE7 , 9TAPE7  JOB  1 .0239,SN=DFC4405, ID=DFCDLS,RP=998) 

EXIT (U) 

DISPOSE (FI CHE, *PR=CFC . ST=COMNM) 

DISPOSE (FICHE2. *FL=CFC,ST=COMNM) 

REQUEST (FRED, *PF , SN=DFC4405) 

FUSE. 

TITLE  (FICHE) 

TITLE (FICHE2) 

TRANZP (TAPE1 ,FL.FID=FICHE2) 

CATALOG (TAPE1,?ZP  FILE  JOB  1 .023?,SN=DFC4405, ID=DFCDLS,RP=999) 

EXIT  (U) 

REWIND (OUTPUT) 

COPY (OUTPUT, FICHE) 

EXIT (U) 

REWIND  (OUTPUT) 

COPY  (OUTPUT,  FRED) 

CATALOG (FRED, $OUTPUT  JOB  1.0239, SN=DFC4405 , ID=DFCDLS , RP=20 ) 

EXIT (U) 

DI SPOSE (OUTPUT . SC) 

*EOS 

SAMPLE  PROBLEM  2  -  COPPER  HEMISPHERE  LOADED  WITH  COMP  B  HE  UNCONFINED 
90PTNS  NPRTOP=2,  TIMMAX=8000,  IFLGST=1,  IPMADJ=1,  ITPHSE=1$ 

9START  PK(1)=1.02,  PK(2)=0,  PK(3)=-0$ 

9RUN  TSTOP=l .5E-04,  PRDELT=2 .0E-06,  KUNITW=7,  KUNITR=7,  NOSLIP=0, 
LVISC=1 ,  NMAT=2,  NMXCLS=600,  NTRACR=5,  NTPMX=900,  REZ=0 . , 
CYCPH3=1 ,  NLINER=1 ,  NSLD=300,  MAPS=1,  IMAX=60,  JMAX=132, 

DMIN-100 ,  IPR=100,  NFRELP=100,  NDUMP7=1,  IPCYCL=0,  11=3, 

12=53,  CVIS=-1,  IGM=09 
60  .05 
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TITLE2=$DAVID  L.  SMIIH$ 
TITLE3=$ WRITTEN  "DATE"$ 
INDEX=l/l/25 
•EOS 

TITLE l=$PLOTS  FOR  JOB  1.023$ 
TITLE 2=$DAVID  L.  SMITH$ 
TITLE3=$ WRITTEN  "DATE"$ 

PLOT 


0.0 

4.55E+11 

0.0 


APPENDIX  2 


JOB  TO  RUN  SHAPED  CHARGE  PROBLEM 


SCRUN (P 10 , T20000 , MS 140000 . ML600 ) 

COMMENT.  JOB  TO  RUN  PROBLEM  50.0009 

COMMENT.  PLANE  WAVE  DETONATION,  MRL  38  MM  CHARGE 

LIMIT  (0) 

GETSET (DFC4405) 

REQUEST (TAPE1 , *PF , SN=DFC4405) 

REQUEST (TAPE7 , *PF , SN=DFC4405) 

REQUEST (TAPE20, 'PF, SN=DFC4405) 

ATTACH, LGO,$LGO  FOR  MRLHLP20$ , SN=DFC4405, ID=DFCDLS . 

MAP  (PART) 

PREGON (FWA=0 , LWA=140000) 

LGO (PL=200000) 

EXIT (U) 

PRLIST(,TAPE20,) 

CATALOG (TAPE20,$PRLI ST  JOB  50 .0009$ , SN=DFC4405, ID=DFCDLS,RP=998) 
REWIND (TAPE20) 

COPY (TAPE20 . OUTPUT) 

EXIT (U) 

REWIND (TAPE7) 

ATTACH (LG02 , $LGO  FOR  FI CHE  PLOTS$,SN=DFC4405,ID=DFCDLS) 

MAP (OFF) 

LG02 . 

EXIT  (U) 

REWIND (TAPE7) 

CATALOG (TAPE7,$TAPE7  JOB  50 .0009$,SN=DFC4405, ID=DFCDLS,RP=998) 
EXIT  (U) 

DISPOSE (FICHE. *PR=CFC , ST=COMNM) 

DISPOSE (FICHE2, *FL=CFC,ST=COMNM) 

REQUEST (FRED, *PF, SN=DFC4405) 

FUSE. 

TITLE  (FICHE) 

TITLE  (FICHE2) 

TRANZP (TAPE1,FL,FID=FICHE2) 

CATALOG (TAPE 1,$ZP  FILE  JOB  50 .0009$,SN=DFC4405, ID=DFCDLS,RP=999) 
EXIT  (U) 

REWIND (OUTPUT) 

COPY (OUTPUT, FICHE) 

EXIT (U) 

REWIND (OUTPUT) 

COPY (OUTPUT, FRED) 

CATALOG (FRED, $OUTPUT  JOB  50.0009$,SN=DFC4405,ID=DFCDLS,RP=20) 
EXIT (U) 

DISPOSE (OUTPUT, SC) 

*EOS 

SHAPED  CHARGE  RUN  ...  MRL  STANDARD  38  MM  CHARGE 

$OPTNS  IFLGST=1 ,  IPMADJ=1,  ITPHSE=1,  NPRTOP=2,  TIMMAX=7200$ 

$START  PK(lj=50 .0009,  PK(2)=0,  PK(3)=0$ 

$RUN  BBAR= . 5 ,  CRATIO=l .0E04,  CVIS=-1,  CYCMX=2,  CYCPH3=-1, 

DMIN=1 .0E-03,  DTMIN=1.0E-11,  EMIN=1.0E07,  FINAL=0.4, 

IEXTX=0 ,  IGM=0,  IMAX-60 ,  IPR=35,  11=3,  12=13,  JMAX=132, 
KUNITR=7,  KUNITW=7,  LVISC=1,  MAPS=2,  MINX=0,  MAXX=40, 

MINY=27 ,  MAXY=106,  NADD=10,  NDUMP7=1,  NFRELP=20, 

NMXCLS=600 ,  NLINER=1,  NMAT=3,  NSLD=300,  NTCC=0, 

NTPMX=1800,  NTRACR=3,  PMIN=5.0E06,  PRCNT=1 .0E-03, 

PRDELT=1 . 0E-06 ,  ROEPS=l .0E-05,  SIEMIN=1.0E05,  STAB=1 .0E-03, 

ICS TOP =500$ 
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1.8034 

9.2898 
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1.905 

9.2898 
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1.905 

4  10 

9.2898 

2.2225 

9.2898 

2 

2.2225 

4  180 

9.2898 

2.2225 

2.0 

1 

2.2225 

4  10 

2.0 

1.905 

2.0 

-1 

1.905 

4  150 

2.0 

0.0 

2.0 

100 

1 

0  1 

0  350 

0 

0 

2  0 

151  0 

650 

3 

0  231 

0  380 

0 

99 

2  0.0 

2.0 

0.0 

2.0 

1.91 

8.66 

•EOS 

LIST. 

TOTAL (0 , 140000 ) 

•EOS 

TITLE l=$OUTPUT  JOB  50.0009$ 
TITLE2*$DAVID  L.  SMITH$ 
TITLE3=$ WRITTEN  "DATE"$ 
INDEX=I/l/25 
•EOS 

TITLE l=$PLOTS  FOR  JOB  50.0009$ 
TITLE2*$DAVID  L.  SMITH$ 

TITLE 3=$WRITTEN  "DATE"$ 

PLOT 

•EOP 
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APPENDIX  4 


AN  UPDATE  RUN  TO  ADD  SUBROUTINE  TCRCHK 
TO  THE  HELP  CODE 


MAKE 2 1,P 1000, T 100 . 

GETSET  DFC4405 . 

ATTACH ! OLDP L , MRLHLP  20 , SN=DF C440 5 , 1 D=DF CDLS . 

REQUEST, NEWPL, *PF,SN=DFC4405. 

REQUEST, LGO, *PF , SN=DFC4405 . 

UPDATE, F,N,L=A1. 

FTN,D=TAPE33, I=COMPILE , SL=0 , LCM=I . 

CATALOG , NEWPL , $MRLHLP 2 1$ , SN=DFC4405 , ID=DF CDLS , RP=998 . 

CATALOG (LGO, $LGO  FOR  MRLHLP21$, SN=DFC4405, ID=DFCDLS,RP=998) 

•EOS 

•IDENT  SPEEDUP 
•DELETE  FIT2.71 

3  DETIM,  COINC 

•DELETE  FIT2.85 

4  OUTIE (4) ,OUTKE (4) .DETIM (8001) .COINC (1800) 

•INSERT  ADDTCR.27 

NADDED=0 

•INSERT  ADDTCR.93 

NADDED=NADDED+NADD 
•INSERT  ADDTCR. 179 

WRITE (6.301) NADDED 
IF (NADDED . LE . 0 ) GOTO  300 
CALL  TCRCHK 

300  CONTINUE 

301  FORMAT (//,  IX.  15, 2X, 13H1RACERS  ADDED) 

•INSERT  MAIN. 17 

C  FILL  COINC  ARRAY  TO  CHECK  FOR  COINCIDENT  TRACERS  ON  SETUP 
C  OR  RESTART  OF  PROBLEM 
CALL  TCRCHK 
•DELETE  MOVTCR .41,56 
•DELETE  MOVTCR. 213, 217 
1505  CONTINUE 
•INSERT  MOVTCR. 222 
C  MOVE  FREE  SURFACE  TRACERS 

C  NOTE  ALL  VOID  TRACERS  ARE  NOW  FLAGGED  TO  A  TRACER  OF  ANOTHER 
C  MATERIAL.  ARRAY  COINC  HOLDS  THIS  INFO,  PACKAGE  NUMBER  AND  MATERIAL 

C  NUMBER  PACKED  INTO  ONE  WORD.  SUBROUTINE  TCRCHK  UPDATES  THIS  ARRAY 

C  AS  REQUIRED. 

IF (NV.LE .0) GOTO  2001 
DO  2000  1=1, NV 
CC=COINC (I) 

IF (CC.LE.O .0)GOTO  2000 
NPACK=  I NT ( (CC-AINT (CC) ) * 10 . 0 ) 

NTCR=  I NT (CC) 

TX (NVOID , I) =TX (NPACK , NTCR) 

TY (NVOID , I ) =TY (NPACK, NTCR) 

2000  CONTINUE 

2001  CONTINUE 
•INSERT  VDCLOS . 140 

CALL  TCRCHK 
•ADDFILE 
•DECK  TCRCHK 

SUBROUTINE  TCRCHK 
•CALL  COMDK 

C  THIS  SUBROUTINE  CHECKS  FOR  COINCIDENT  TRACERS  WITH  THE  VOID 

C  PACKAGE.  THIS  INFORMATION  IS  NEEDED  BY  SUBROUTINE  MOVTCR, 


WHICH  PREVIOUSLY  RE-CHECKED  EVERY  CYCLE,  EVEN  WHEN  NO  CHANGES 
WERE  POSSIBLE  FROM  THE  PREVIOUS  CHECKS. 

TCRCHK  IS  CALLED  BY  MAIN  AFTER  SETUP  IS  CALLED,  EITHER  FOR  A 
NEW  PROBLEM  OR  A  RESTART. 

TCRCHK  IS  ALSO  CALLED  BY  ADDTCR  WHEN  NEW  TRACERS  ARE  ADDED 
AND  BY  VDCLOS  WHEN  A  VOID  IS  CLOSED. 

WRITTEN  BY  DAVID  L.  SMITH  APRIL  1984. 


NV=NMP (NVOID) -1 
IF (NV.LE.O) RETURN 
DO  100  NN=1,NV 
COINC (NN) =0 . 0 
TXV=TX (NVOID, NN) 

IF (TXV.LE. -1000.) GOTO  100 
TYV=TY (NVOID, NN) 

DO  110  NPACK=1 ,  NMAT 
NT=NMP  (NPACK)  -1 
DO  120  NTCR=1,NT 

IFffiQ.TX(NPACK,NTCR)  . AND. TYV.EQ.TY (NPACK, NTCR )) GOTO  130 

CONTINUE 
GOTO  100 

COINC (NN)=FLOAT (NTCR) + (FLOAT (NPACK)  +  .001)/10.0 

CONTINUE 

WRITE (6, 200) 

RETURN 

FORMAT (//.34H  SUBROUTINE  TCRCHK  HAS  BEEN  CALLED,//) 
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